CDS

Accession Number TCMCG075C20953
gbkey CDS
Protein Id XP_007026413.1
Location join(26187755..26187851,26188325..26188384,26188480..26188551,26188671..26188769,26189554..26189665,26189737..26189814,26189928..26189961,26190035..26190103,26190234..26190308)
Gene LOC18597364
GeneID 18597364
Organism Theobroma cacao

Protein

Length 231aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007026351.2
Definition PREDICTED: DNA-directed RNA polymerases IV and V subunit 4 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description DNA-directed RNA polymerases IV and V subunit
KEGG_TC -
KEGG_Module M00180        [VIEW IN KEGG]
KEGG_Reaction R00435        [VIEW IN KEGG]
R00441        [VIEW IN KEGG]
R00442        [VIEW IN KEGG]
R00443        [VIEW IN KEGG]
KEGG_rclass RC02795        [VIEW IN KEGG]
BRITE br01611        [VIEW IN KEGG]
ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K03012        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko00240        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko03020        [VIEW IN KEGG]
ko05016        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map03020        [VIEW IN KEGG]
map05016        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGGAGAAGGGAGGCAAAGGGTTTTCATTGCCCACCAAAACAACACCTAAATCTGCTCTCAAATCTACCCCGGCTTCTGCCACTGCTAGACATGGAAAAGATGATAATTCTGCAAAATCAAAGAGGGGAAGGAAAGTTCAGTTTGGAATGGAAGGTTTACCTAACCTTGGATTTAATTTCTCATCGCCAAAATCTGATGGCAAGTTTGCAATCCCTGTTGGTAAAGGTGACTGGGCCAAGGGAGGAAAGGGAGAAAAGGTGGTCAATGGTGGAAAGGCCCCTGTGGCAAAAGAAGCTAAGTCATTGGAGCTCAGAGTTGAACAGGAACTTCCAGAAAATGTTAAATGCCTCATGGATTGTGAGGCTGCAAATATTTTAGAAGGCATCCAGGAACAAATGGTTATGCTCTCTCAAGATTCAACTATTAAGCTGCCCGAATCATTTCATTTAGGACTGCAGTATGCCAAGACTCGTAGCTATTATACTAATCCCCAGTCTGTCAGACGAGTTCTTGAGGCTCTTTCAAAATATGGTGTCTCTTACAGTGAGATTTGTGTGATTGCAAATACTTGTCCAGAAACTGTTGATGAAGTTTTTGCTCTTGTTCGATCCTTGGAGGCTAAGAAAAGCAGGCTCAGTGAACCACTTAAAGATGTATTGGATGAGCTAGGTAAACTTAAAAAATCCACCTGA
Protein:  
MSEKGGKGFSLPTKTTPKSALKSTPASATARHGKDDNSAKSKRGRKVQFGMEGLPNLGFNFSSPKSDGKFAIPVGKGDWAKGGKGEKVVNGGKAPVAKEAKSLELRVEQELPENVKCLMDCEAANILEGIQEQMVMLSQDSTIKLPESFHLGLQYAKTRSYYTNPQSVRRVLEALSKYGVSYSEICVIANTCPETVDEVFALVRSLEAKKSRLSEPLKDVLDELGKLKKST